Rede BIOFOCO: A distributed computation of Interpro Pfam, PROSITE and ProDom for protein annotation
نویسندگان
چکیده
Interpro is a widely used tool for protein annotation in genome sequencing projects, demanding a large amount of computation and representing a huge time-consuming step. We present a strategy to execute programs using databases Pfam, PROSITE and ProDom of Interpro in a distributed environment using a Java-based messaging system. We developed a two-layer scheduling architecture of the distributed infrastructure. Then, we made experiments and analyzed the results. Our distributed system gave much better results than Interpro Pfam, PROSITE and ProDom running in a centralized platform. This approach seems to be appropriate and promising for highly demanding computational tools used for biological applications.
منابع مشابه
NetAffx GPCR annotation database summary
Only approximately 51% of the human proteome can be annotated by the standard motif-based recognition systems [1]. These systems, currently aggregated into a single distributed system by InterPro [2], include PFAM, PRINTS, ProSite, ProDom, SMART, and SWIS-PROT+TrEMBL. PFAM consists of hidden Markov models based on hand-curated alignments of protein domains. PRINTS is a repository of protein fin...
متن کاملThe InterPro database, an integrated documentation resource for protein families, domains and functional sites
Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, liter...
متن کاملReference InterPro , progress and status in 2005 MULDER , Nicola
InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is pro...
متن کاملThe InterPro BioMart: federated query and web service access to the InterPro Resource
The InterPro BioMart provides users with query-optimized access to predictions of family classification, protein domains and functional sites, based on a broad spectrum of integrated computational models ('signatures') that are generated by the InterPro member databases: Gene3D, HAMAP, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. These predictions are provided...
متن کاملInterProScan - an integration platform for the signature-recognition methods in InterPro
UNLABELLED InterProScan is a tool that scans given protein sequences against the protein signatures of the InterPro member databases, currently--PROSITE, PRINTS, Pfam, ProDom and SMART. The number of signature databases and their associated scanning tools as well as the further refinement procedures make the problem complex. InterProScan is designed to be a scalable and extensible system with a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genetics and molecular research : GMR
دوره 4 3 شماره
صفحات -
تاریخ انتشار 2004